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Abstract 

We introduce the A^/GI/l queue, a new queueing model. In this model, customers from 
a given population independently sample a time to arrive from some given distribution F. 
Thus, the arrival times are an ordered statistics, and the inter-arrival times are differences 
of consecutive ordered statistics. They are served by a single server which provides service 
according to a general distribution G, with independent service times. The exact model is 
analytically intractable. Thus, we develop fluid and diffusion limits for the various stochastic 
processes, and performance metrics. The fluid limit of the queue length is observed to be a 
reflected process, while the diffusion limit is observed to be a function of a Brownian motion 
and a Brownian bridge process, and is given by a 'netput' process and a directional derivative 
of the Skorokhod reflected fluid netput in the direction of a diffusion refinement of the netput 
process. We also observe what may be interpreted as a transient Little's law. Sample path 
analysis reveals various operating regimes where the diffusion limit switches between a free 
diffusion, a reflected diffusion process and the zero process, with possible discontinuities during 
regime switches. The weak convergence is established in the Mi topology, and it is also shown 
that this is not possible in the J\ topology. 

1 Introduction. 

Most of modern queueing theory is concerned with scenarios where arrival and service processes a 
stationary and ergodic. That the arrival process is a renewal process with i.i.d. inter-arrival times 
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a common modeling assumption. This is mathematically convenient as it allows full use of the tools 
that renewal theory and ergodic theory provide. However, it is not true in some queueing scenarios. 
For example, in some queueing scenarios, each arriving customer takes an independent decision of 
when to arrive. When we assume that every arriving customer draws an arrival time from the same 
distribution, this does not lead to a renewal arrival process. Moreover, such a distribution may only 
have finite support meaning that the system is transient. This scenario does not fit the standard, 
single-server models in queueing theory such as M/M/l, M/G/l, etc. 

There has been an interest in developing a theory for transient queues [15]. The first such models 
for time-dependent queues were the early attempts of Newell [23] (see also [15, 21, 9, 22]), and the 
more recent developments for Mt/Mt/1 in [18] and state dependent Markovian queues in [19]. 
However, in all of these the assumption of a renewal arrival process (albeit time-inhomogeneous) 
remains ubiquitous. Furthermore, all such models still assume a queueing system operating forever, 
with an infinite population of customers and a steady state. In contrast, many queueing systems 
serve only a finite number of customers, and in fact, the queueing system itself may be transitory, 
i.e., it may operate only in a finite window of time, meaning that the concept of a steady state does 
not exist. A goal of the present work is to propose a queueing model, and develop its theory, that 
is relevant for such transitory queueing systems. 

Such models can arise when service starts at a certain time, and customers may choose to arrive 
early. For example, when customers go to a rock concert in a Greek theatre, they may choose to 
arrive before the gates open, or arrive any time after until the gates close. Such a scenario was 
studied as the concert arrival game in [13, 11]. Other scenarios where such a model may be 
relevant include queueing outside stores for black Friday sales, outside Apple stores before new 
product launches, DMV or postal offices, lunch cafeterias, etc. Some variation of this model may 
also be relevant for call centers where customers take independent decisions of when to call and 
service time is finite (8am-5pm, for example) even though customers typically cannot call before 
service starts. Although, in some enterprise technical support call centers, cases can be put in the 
queue via the web even before service starts in the morning. 

It is easy to see that the queueing processes in such scenarios are inherently transient, and there 
really is no steady state for the system. Unfortunately, transient performance measures for queueing 
systems can be very difficult to compute, and often necessitate the use of numerical schemes. This 
makes the study of transitory queueing systems very hard. A goal of the present work is to address 
this deficiency by introducing a model, theory and approximations for a transitory queueing model. 

In this paper, we introduce a new queueing model that has a finite population of customers 
whose arrival process is not a renewal process, and in which service occurs only during a finite time 
window. In particular, it is a transitory queueing model. Consider n customer who arrive into a 
single-server queue. Each customer's time of arrival is sampled i.i.d. from a distribution F. Service 
times have a general distribution G and are i.i.d. Thus, the times of arrival are ordered statistics. 
If we denote the iih order statistic from a sample of size n from the distribution F by and 
A(j) := (-X'(j) — -^(i-i))) then in Kendall's notation [16], this model can be called the A^/GI/1 
queueing model. 

Exact analysis of this model is impossible for general service processes. Even in the case of a 
Markov service process while the forward differential equations can be written down there is no 
easy way to obtain analytical solutions to them, and one must study them numerically. Therefore, 
we develop fluid and diffusion approximations for the queue-length process, as the population size 
n increases and the service rate is accelerated. Then, we establish a transient Little's law that links 
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the limiting queue-length and virtual waiting time processes under both fluid and diffusion scaling. 

To develop our fluid and diffusion limits, we first use the Glivenko-Cantelli theorem and the 
functional strong law of large numbers to develop a fluid limit for the netput process (that is, the 
difference between the arrival process and the potential service completion process that assumes 
the server is always busy). The fluid limit for the queue- length process then follows from the 
Skorokhod reflection mapping theorem [29, 10]. We observe an interesting behavior in the fluid 
limit: it switches between overloaded, underloaded, and critically loaded periods, as time pro- 
gresses. The limiting diffusion for the queue-length process then arises as the directional derivative 
of the one-dimensional Skorokhod map for the fluid process, having input that is the diffusion 
netput process approximation. The diffusion netput process approximation combines a Brownian 
bridge, that arises from the invariance principle related to the Kolmogorov-Smirnov statistic, and 
a Brownian motion, that arises from the functional central limit theorem for renewal processes. 
We also note that our diffusion process convergence results are in Skorokhod's M\ topology on the 
space 2?ii m [0, oo), the space of functions that are right or left continuous at every point, and right 
continuous at 0. Recall that the M\ topology is weaker than the J\ topology. We further show 
that convergence for the A^/GI/l model cannot be achieved in the stronger Ji topology, and by 
implication, in the even stronger uniform topology. 

The most standard heavy traffic approximation in the literature for a single server queue is 
a reflected Brownian motion diffusion approximation; see, for example, Chapter 6 in [5] and the 
pioneering work in [12]. The reflected Brownian motion approximation is relevant for a G/G/l 
queue in which the inter-arrival and service times are either independent or only exhibit weak 
dependence, and in which both the inter-arrival and service time distributions have finite second 
moments. In the case that there is strong dependence in the inter-arrival or service times, or the 
distributions are heavy-tailed (with an infinite second moment), non-Brownian limits arise; see, 
for example Chapter 4 in [34] and the overview paper [32]. The Au\/GI/l queue has correlated 
inter-arrival times and our model has finite second moments since F has compact support. For a 
G/GI/1 queue in which the sequence of arrival times is stationary and either a zero-mean Gaussian 
process or a zero-mean linear process (so that there is strong dependence but no heavy-tails) and 
the service process is renewal, the appropriate heavy traffic approximation will involve a fractional 
Brownian motion and reflection (which follows from Theorem 4.6.1 and Theorem 9.3.1 in [34]). 
However, the A^/GI/1 queue does not fit that framework, because the inter-arrival time sequence 
is not stationary, even though the distribution from which customers sample their arrival times 
does not change with time. In other words, the An\/GI/l queue has a closer connection with 
single server queues that have a time-varying arrival rate. 

Thus, it is pertinent to compare the A^/GI/1 model we introduce to the already studied 
M t /M t /1 queueing model. One of the earliest papers on the latter model is [22]. There, a uniform 
acceleration technique was developed that scales the arrival and service rates by dividing with 
a small positive constant. The state transition probabilities are studied as the positive constant 
decreases to 0. Strong approximations for the model were later developed by Mandelbaum and 
Massey [18], wherein the scaled arrival and service processes were approximated by Gaussian pro- 
cesses on the same accelerated time scale, by leveraging well known strong approximation results 
for Levy processes. Here, we adopt a different approach by utilizing the Skorokhod almost sure 
representation theorem to establish the desired results. The fluid and diffusion limits that arise for 
the A(j)/G//1 queue parallel the fluid and diffusion limits that arise for the M t /M t /1 queue in the 
sense that (1) the fluid limit may switch between overloaded, underloaded, and critically loaded 
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periods, and (2) the diffusion limit arises using a directional derivative for the Skorokhod reflection 
map. The key difference is that our fluid and diffusion limits are formed using results for the 
convergence of the order statistics (i.e., the Glivenko-Cantelli and Kolmogorov-Smirnov theorems), 
whereas such results are not relevant in [18]. 

Transitory behaviors in queueing models however, have been proposed earlier. In the late 1960s 
[23], Newell introduced queueing models with both a time- varying arrival and service processes. He 
studied the Fokker-Planck (or heat) equation for the Gaussian process approximation to a general 
arrival process in various special cases on the arrival rate function, such as a transition through 
saturation and rush hour states. These results were later summarized in [24]. However, these 
approximations were not rigorously justified with a weak convergence result, and they are not 
"process-level" results in the spirit of the results here. In [7], the authors discuss several transitory 
demand queueing problems very much like the A^/GI/1 queue. However, they do not develop a 
comprehensive theory for such models. 

Perhaps the work closest to the current paper is [17], where the author considers the same 
setup as we have but does not allow early arrivals. The paper develops diffusion approximations to 
the queue length in separate, distinct intervals and the maximum queue length process. However, 
without establishing a "process-level" convergence over all time, such a result is rather incomplete. 
In fact, it is not difficult to derive point-wise limits to the queue length process. Establishing 
"process-level" convergence for such limiting processes in an appropriate topology is the main 
mathematical difficulty, as was also observed in [18] for the M t /M t /1 model. Also, worth mentioning 
here is the work on queueing systems with periodic arrivals studied in [8] and [2]. These models 
are, of course, distinct from the A^/GI /l model that we introduce in this paper. We note that 
our results establish a single "process-level" convergence result over all time and recover all the 
results in [23] for various special cases as well as those in [17]. 

The rest of this paper is organized as follows. Section 2 presents the Au\/GI/l queueing model 
and some basic results about fluid and diffusion approximations to arrival and service processes. 
Section 3 develops fluid approximations to the queue length, busy-time and virtual waiting time 
processes. In Section 4, we develop diffusion approximations to these processes. Section 5 takes a 
closer look at the sample paths of the queue length process in various operating regimes. Section 
6 presents distributional approximations. We then conclude in Section 8 with some remarks about 
potential future directions. In the appendix, we place proofs that are more technical in nature. 

2 Preliminaries. 

In this section, we introduce the finite population A^/GI /\ queueing model. We first provide 
pathwise descriptions of the arrival process, the service process, the virtual waiting time process, 
and the queue-length process. We next present our large populatoin asymptotic regime, and provide 
the fluid and diffusion limits for the arrival and service processes in that regime. These limits will 
be used to develop asymptotic approximations to the system parameters in Sections 3 and 4. 

2.1 Queue Model. 

Consider a single server, infinite buffer queue that is non-preemptive, non-idling, and starts empty. 
The service occurs over a finite time interval, and is first-come-first-served (FCFS) basis. There is 
a large but finite number of customers, and some may be early birds in the sense that they arrive 
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before service begins. 

Let n be the customer population size. Arriving customers independently sample an arrival 
time Tj, i = l,...,n, from a fixed cumulative distribution function F that is assumed to have 
support [-T ,T] C M, where -T < and T > 0. Thus, F(-T ) = and F(T) = 1. Customers 
enter the queue in increasing order of the sampled arrival times. The arrival process is the number 
of customers that have arrived by time t, defined as 

n 

A(t):=£l {21 < t} . (1) 

i=i 

The customer arrival times are the order statistics T^), T( 2 ), • • . , T( n ). 

Let {z/j, i > 1} be a sequence of independent and identically distributed (IID) random variables, 
where v\ represents the service time of the ith customer. We assume Ei/j = Var(f,) is finite, 
the associated cdf G has support [0, oo), and the sequence is independent of the arrival times Tj, 
i = 1, . . . , n. Define S to be the associated renewal process, so that 

S(t) := sup{m > l\V(m) < t}, Vt > 0, (2) 

where 

rn 

V(m) :=J>- 
i=l 

Note that /x is the system service rate, and S(t) can be interpreted as the number of customers 
that could be served if the server were busy all the time in the interval [0, t] . Also note that S is 
defined for all t > 0, so service starts at time in the A^/GJ/l model. We define S(t) = for all 
t < 0. 

The sum V{m) represents the amount of work (in units of time) presented by the first m 
customer arrivals. The virtual waiting time at time t, or the amount of time a customer arriving 
at time t would have to wait for service, is 

Z(t):=V(A(t))-B(t)-tl {tm , (3) 

where 

B(t) :=(J l {OW >o}<fe)l {t >o}, Vt G [-T ,oo). (4) 

In words, Z(t) is the difference between the total amount of work presented by the arrivals up to 
time t and the amount of work completed by the server by t. Note that this definition of the virtual 
waiting time varies slightly from the standard definition due to the fact that an arrival at time 
t < before service starts has to wait an extra t units of time for service to start, which accounts 
for the — il{ t <o} term. 

We also define the idle time process of the server as 

J(t) := tl {tm - B(t) =(J l {Q{s)=oy ds\ l {tm Vt G [-T , oo). (5) 

This is the amount of time the queue has been empty after service has started at time 0. 
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Finally, let Q represent the queue length process, including both any customer in service and 
all waiting customers. This is defined in terms of the arrival and service processes as 

Q(t) := A(t) - S(B(t)), Vt G [-T , oo). (6) 

In words, Q(t) is the difference between the cumulative number of arrivals up to time t ((A(t)) and 
the cumulative number of departures before time t (S(B(t))). 

We note that this model is intractable to exact analysis. Therefore, we develop asymptotic 
approximations for the queueing and virtual waiting time processes, as the population size increases. 

2.2 Basic results. 
Notation. 

Unless noted otherwise, all intervals of time are subsets of [— To, oo), for a given —To < 0. Let 
T'lim := Aim!— T), oo) be the space of functions x : [— To,oo) — > R that are right-continuous at 
—To, and are either right or left continuous at every point t > —To. Note that this differs from 
the usual definition of the space T> as the space of functions that are right continuous with left 
limits (cadlag functions). We denote almost sure convergence by —4 and weak convergence by 
=>. The topology of convergence is indicated by the tuple (S,m), where S is the metric space of 
interest and m is the metric topologizing S. Thus, X n -^A X in (TW, U) as n — > oo indicates that 
X n G T)\\ m converges to X 6 T>n m uniformly on compact sets (u.o.c.) of [— To,oo) almost surely. 
Similarly, X n X in (T>n m , U) as n — > oo indicates that X n G TW converges weakly tol£ T'lim 
uniformly on compact sets of [— To,oo). (T>n m , Mi) indicates that the topology of convergence is 
the Mi topology. X indicates a fluid-scaled or fluid limit process. X and X are used to indicate 
diffusion-scaled and diffusion limit processes. We use o to denote the composition of functions or 
processes. The indicator function is denoted by and the positive part operator by (■)+. 

We now present known functional strong law of large numbers (FSLLN) or fluid limits, and 
functional central limit theorem (FCLT) or diffusion limits, for the arrival and service processes, 
as the population size n increases to oo. Our convention is to superscript any process associated 
with the model having population size nby n. 

We start with the arrival process. Let A n := A be the arrival process associated with the system 
having population size n. The fluid-scaled arrival process is 

An 

A n := — . 
n 

Next, we accelerate the service rate so that ji n := n\i. Correspondingly, the scaled service times 
are vf := Vi/n for i = 1, . . . , n, and the accelerated service process is 

S n {t) :=sup lm> < * f" , * > 0. 

i=l 

The fluid-scaled service process is 

an . = I.cn 
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Also, the fluid-scaled offered waiting time process is 

[nt\ 

V n (t):=J2»?, Vt€[0,oo). (7) 
i=i 

The following lemma establishes the fluid limits for these processes. 
Proposition 1. As n — > oo, 

(A n (t),S n (t)l t > ,V n (t)l t >o) ^ (F(t),^l t > ,-l t >o) in (V lhn ,U). (8) 

Remarks. 1. The proof of Proposition 1 follows easily from standard results: The fluid arrival 
process limit is given by the Glivenko-Cantelli Theorem (see [6]). The fluid limits of the service 
process and the offered work process follow from the functional strong law of large numbers for 
renewal processes (see [5]). 

Next, using the fluid limits from Proposition 1, we present functional central limit theorem or 
diffusion limits, to the appropriately standardized or diffusion-scaled processes. The diffusion-scaled 
arrival process is defined as 



A n (t) := Vn\A n (t) - F(t)J Vt G [-T , oo). 
Similarly, the diffusion-scaled service and offered waiting time processes are 

S n (t) := S n (t) ~ fit], *>0 



(9) 



V n {t) := ^(v n {t)-h\ t>0. 

The following proposition presents the diffusion limits for these processes. 
Proposition 2. As n — >■ oo, 

(i n , S n , V n ) => (w° o F, afi 3/2 W o e , -afi 1/2 W o ^\ m (Aim, U), 

where W° is the standard Brownian bridge process and W is the standard Brownian motion process, 
both are mutually independent, e : [0, oo) — > [0, oo) is the identity map. 

Remarks. 1. The proof of this proposition follows easily from standard results: The FCLT limit 
for the diffusion-scaled arrival process, also called the empirical process, is a Brownian bridge by 
Donsker's Theorem (see Sections 13 and 16 in [3]). Note that this limit also arises in the study of the 
invariance principle associated with the Kolmogorov-Smirnov statistic used to compare empirical 
distributions with candidate ones (see [34] for more detail). The limits for the diffusion-scaled 
service and offered work processes follow from the FCLT for renewal processes (see Section 16 in 
[3] and Chapter 5 in [5]). 

2. Note that we have not placed any restriction on the arrival distribution F other than that of 
finite support. The proofs of the fluid limits to the performance metrics will hold for arbitrary 
distribution functions F. However, the diffusion limits require F to be absolutely continuous, since 
the form of the limit process depends on this fact. Extending the result to arbitrary F appears 
non-trivial, and left for future work. 
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3 Fluid Approximations. 

We first derive the fluid limit for the queue-length process, and next derive that for the virtual 
waiting time process. Recall that the queue length process is defined in (6). The corresponding 
fluid-scaled queue length process is 

O n (t) 1 1 

?LA± = -A n (t) - -S n (B n (t)), (10) 
n n n 

where B n (t) is the fluid-scaled version of the busy time process (4) defined as 

ft 



B n (t) := Q l ms) >o}<fa)l {t >o}- (11) 



Now, we can re- write (10) by adding and subtracting the functions F and \iB n to obtain 



n \ n j \ n 

Similarly, adding and subtracting the function /xtl{ t>0 } shows that 

Q n (t) (A n (t) 



n \ n 

where 



I n (t)=tl {tm -B n (t) 



is the fluid-scaled idle time process. Then, the fluid-scaled queue-length process is equivalently 
written as 

Q n {t) := = X n (t) + vl n (t), Vt G [-T , oc), (12) 
where X n (t) is defined to be 

X n (t) := [^f> - F(t)j - ( S " (i ^ (t)) - A^(*)) + (F(t) - ^l {t >0})- (13) 

In preparation for the main Theorem in this Section, recall that the Skorokhod reflection map 
is a continuous functional (<£>, : X>n m — >■ £>ii m x V\ [m defined as 

x i->- <J/(x) := sup (— x(s))+, 

-T <s<t 

and 

x i-)- := x + ^(x), \/x e Pli m . 

The continuity of the map with respect to the uniform topology on T)\\ m follows from Theorem 3.1 
in [20]. 

Theorem 1 (Fluid Limit). The pair (Q n ,fj,I n ) has a unique representation (<3?(X n ), ^(X n )) in 
terms of X n . Furthermore, as n — > oo, 

(Q",^) ^> ($(X),*(X)) m (P lim ,C/), 

where X(t) = (F{t) - fitl {tm ). 
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Proof. First note that Q n (t) > 0, Vt G [-T ,oo). It is also true that I n (-T ) = and <i/ n (t) > 
0, Vt G [-T ,oo). By definition of I n (t), it follows that J™ To Q n {t)dI n {t) = 0. Thus, by the 
Skorokhod reflection mapping theorem (first proved in [28]), the joint process (Q n (t), fj>I n (t)) has 
a unique reflection mapping representation in terms of X n (t) as ($(!"), *f/(X n )). 
Note that by definition of B n (t) < t and from Proposition 1, it follows that 

^0 in (V hm ,U). (14) 

Using (14) and Proposition 1 it follows that 

X n ^X in (Aim, U), 

where X := (F(i) — //il{ t >o}). Using the limit derived above and the continuous mapping theorem 
(see, for example, Theorem 5.2 of [5]), it follows that 

(Q n ,^I n ) = ($(P),*(X n )) ^ (<D(X),*(X)) in (V lim ,U). 

□ 

Remarks. 1. Note that X is the difference between the fluid limits of the arrival and service 
processes, and is often referred to as the fluid limit of the netput process. In effect, it is the amount 
of net potential fluid flow into the system. 

2. Theorem 1 shows that the fluid limit of the queue length process is 

Q(t) = (F(t) - Htl {t>0} ) + sup {-{F(s) - ^sl {s>0} ))+, Vt G [-T , oo). 

-T <s<t 

Q can be interpreted as the sum of the fluid netput process and the potential amount of fluid lost 
from the system. Suppose that (F(t) — /j,tlj t > \) < so that the fluid service process has "caught 
up" and exceeded the cumulative amount of fluid arrived in the system by time t (for simplicity 
assume t > 0). Further, suppose f(t) — n < 0, implying that the netput process is decreasing at t. 
In this case, sup_ T()<s<t (—(F(s) — fisl^ s>0 y)) + = —(F(t) — fit). This is the amount of extra fluid 
that could have been served, but is now lost. 

3. Figure 1 depicts an example queue length process in the fluid limit, and its dependence on the 
arrival distribution F and service rate \i. Note that the process switches between being positive 
and zero, during the time the queue operates. We will investigate this behavior in detail in Section 
5. 

4. Observe that idle time process (5) differs from the usual definition (see Section 6.2 of [5], for 
example) as it does not measure the idling of the queue as a whole. Another plausible definition of 
an idle time is the following 

Y(t) := f l {Q{s)=0} ds, Vt€[-T ,oo). (15) 
J o 

This measures the total time that the queue (buffer and server) has been empty. The follow- 
ing lemma shows that asymptotically in the fluid scaling there is no difference between the two 
definitions. The proof is in the Appendix. 



S n oB n 



n 



-\iB n 
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Figure 1: An example of a A^/GI/1 queue that will undergo multiple "regime changes". The 
fluid queue length process is positive on [— Tq,t\) and [t2,ts), and on [t\,T2) and [r, oo). 



Lemma 1. As n — > oo, 



jn _ Y n 



in (V hia ,U). 



It is interesting to observe that B n does not converge to the identity process. This is in contrast 
to the fluid limit for the GI/GI/1 queue in the heavy-traffic approximation setting, in which the 
fluid-scaled busy time process does converge to the identity process. 



Corollary 1. As n — > oo, 



B n ^B in (V hni , U) 



(16) 



where B(t) := tl {t > 0} - ^(X(t)), Vt G [-T ,oo). 

Proof. By definition, we have B n (t) = il{t>o} — I n (t)- This can be rewritten as 

B n (t) = tl {tm -I n (t). 
Using Theorem 1, the claim then follows. 



□ 



Note that B(t) = for all t < 0, as V(X)(t) = on that interval. 

The limit for the busy time process is useful in establishing a fluid limit for the virtual waiting 
time process (3). The fluid-scaled virtual waiting time process is 

Z n (t) = ^ n (™(^)) " B n (t)-tl {tm , Vte [-To.oo). (17) 



Proposition 3 (Fluid Transient Little's Law). As n — > oo, 



771 a - s \ 



Z in(D hm ,U), 



(18) 



where Z(t) := Q(t)//j, — tl{ t<0 y. 

Proof. First note that Z n (t) can be rewritten as 
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Recall, from Proposition 1, that V n (t) t/fi in (X>u m , U). This can be extended in a straightfor- 
ward manner to non-decreasing functions h : [0, oo) — > [0, oo), so that 

Voh^h/n in (V lha ,U). 

Now, using the random time change theorem (Theorem 5.3 in [5]) and setting h = A n /n it 
follows that , as n — > oo, 

1 A n 

V n oA n in(P lim ,C7). 

fi n 

Using Proposition 1 and Corollary 1, substituting for B(t), we have Z(t) = ^Q(t) — tl{ t < y. □ 

Remarks. 1. The fluid limit in Corollary 3 is a fluid transient Little's Law because it provides a 
deterministic relationship between the fluid queue-length and virtual waiting time processes. The 
term tl{ t<0 y accounts for the fact that an arrival at time t < would have to have —t time units 
for service to start. 



4 Diffusion Approximations. 

We derive the diffusion limit for the queue-length process in Section 4.1, and then leverage off this 
result to develop the diffusion limit for the virtual waiting time process in Section 4.2. Here, we 
assume F is absolutely continuous in order to establish the desired limit result. Throughout, we 
specialize our results to the case of a uniform F. The uniform F is important because (1) it is 
simple and so aids in understanding, and (2) in the concert arrival game in [13] the equilibrium 
arrival distribution is uniform. 

4.1 Queue Length Process. 

Define the diffusion-scaled queue length process as 

Q n (t) . A»(t) S n (B*(t)) 

— -/=- ■= — t= 7= — , vie -r ,oo) (19) 

y/n y/n y/n 

Rewriting it after introducing the term y/rifxtl 1 1 > ) , we have 

+ Vn{F(t) - /iil{ t > }) + y/nn(tl{ t > y - B n (t)). 
Using the definition of the idle time process 

we can express Q n /y/n as 

% = X n + yfrX + v^/i/ n , (20) 

In 
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where 

X n (t) := - - ~ ^(t)) (21) 

= _ 5 n (B n (t)), Vt€[-T ,oo). 

Recall from Theorem 1 that -X'(t) = — fitl t >o) is the fluid netput process. We can think of 
X n as a diffusion refinement of the netput process. Now, Lemma 2 gives a diffusion limit of X n (t) 
as a direct consequence of Proposition 2. 

Lemma 2. As n — > oo, 

X" 1 := W° o F — an 3/2 W o B in (£> lim , 17) (22) 

where B is defined in (16), and W° and W are independent standard Brownian bridge and standard 
Brownian motion respectively. 

Proof. First note that B n (t) < t, Vt € [0, oo), implying that S n o B n G Aim- Using Proposition 2, 
Corollary 1 and the random time change theorem (see, for example, Section 17 of [3]), it follows 
that 

^ — -^Ua^oB. (23) 



n 

Now, it follows from Proposition 2 and the weak limit (23) that 

X n X(t) := W° o F — a^ /2 W o B. 

□ 

Remarks. 1. Note that using a classical time change (see, for example, [14]) it is possible to see 
that the Brownian bridge is equal in distribution to a time changed Brownian motion, and X is 
equal in distribution to a stochastic integral 

A[t) ~ \-a^W(B(T)), \ft>T ' 1 J 

where 

g(t) = F(t)(l-F(t)) + a 2 ^B(t) 

and TU is a Brownian motion independent of W° and VF. Thus, the process X can also be interpreted 
as a time-changed Brownian motion on the interval [—Tq,T], and its sample path is a constant on 
(T,oo). 

In the rest of this section, we will use Skorokhod's almost sure representation theorem [28, 33], 
and replace the random processes above that converge in distribution by those defined on a new 
probability space that have the same distribution as the original processes and converge almost 
surely. The requirements for the almost sure representation are mild; it is sufficient that the 
underlying topological space is Polish (a separable and complete metric space). We note without 
proof that the space X>ii m , as defined in this paper, is Polish when endowed with the Mi topology. 
This conclusion follows from Theorem 2.6 of [31] and the fact that the proof there extends easily 
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to the case of M\ topology. The authors in [18] also point out that [25] has a more general proof 
of this fact. 

We conclude that we can replace the weak convergence in (9) by 

(A n , S n , V n ) ^ (w° o F, afi 3/2 W, - an 1/2 W o ^ in (D lim , U), 

where abusing notation we denote the new limit random processes by the same letters as the old 
ones. This implies that in Lemma 2, as n — > oo, we actually have 

X n ^X m(V lim ,U). 

Now, our goal is to establish a diffusion limit for the centered queue length process 

Q n (t) := V^(^-QW). (25) 

We achieve this by using the Skorokhod reflection mapping theorem [28, 5, 34] and express (Q n (t) / y/n, y/n^I n {t)) 

uniquely in terms of X n and X. Using this representation, we redefine Q n in terms of X n and 

X, and then establish the necessary limit as n — > oo. Note that we will establish the limit in 

the weaker topology Mi, as opposed to the more common U (uniform) or J\ topologies. This is 

because the directional derivative reflection mapping lemma (Lemma 3) we will use is only available 

for ("Dij m ,Mi). In fact, in Proposition 4 below, a counterexample is provided that shows that the 

limit result is not achievable in the stronger J\ topology. 

Recall that ($, ^) is the Skorokhod reflection map. We denote the directional derivative of the 
Skorokhod reflection map by 

sup(-y)(t) = lim ^(^x + y)(t) - y/nV(x)(t), (26) 

„y n— >oo 
Vf 

in some appropriate topology on Pn m , where x G C and y G Vy mi , and 

Vf = {-T < s < t\X{s) = -*(X)(t)}, 

is a correspondence of points upto time t where the fluid netput process achieves an infimum. 
The following theorem provides the diffusion limit for the queue length process. 

Theorem 2 (Diffusion Limit). The pair (Q n , Y n ) has a unique representation in terms of X n and 
y/nX given by 

(^>(X n + ^X) - v^Q, ^{X n + ^X) - v^^(X)^ , 

where Q = X + ^(X) is the fluid limit of the queue length process. Furthermore, as n — > oo 

(Q n ,Y n )^(X + Y,Y) m (Aim, Mi), 
where X(t) = W°(F(t)) - a^/ 2 W{B{t)), andY{t) = max sgV x (-X(s)) Vt G [-T ,oo). 
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Proof. First using (20), it follows by the Skorokhod reflection mapping theorem that 

(^=,V^fir^J = ^x n + v^x),v(x n + vtix)y (27) 

This implies that 

O n - ~ _ - 

Qn = * _ = ^ X n + _ ^8) 

V 71 



Recall from Theorem 1 that Q = X + ^f(X). Substituting this expression into (28), and using 
the fact that <&(x) = x + \I/(x), for any x £ £>ii m , we have 

Q n = X n + ^/nX + ^(X n + ^/nX)-Vn(X + y(X)), 

= X n + ^{X n + ^nX) - y/rf&{X). (29) 

Next, utilizing the expression for y/n~nl n in (27), and letting 

Y n := - VnV(X), 

we have that 

f n = ^{X n + yftiX) - y/EV(X). (30) 

Therefore, 

Q n = X n + Y n . 

The limit result now follows by use of the following directional derivative reflection mapping 
lemma which is adapted from Lemma 5.2 in [18], and whose proof can be found in the Appendix. 

Lemma 3 (Directional derivative reflection mapping lemma). Let x and y be real-valued continuous 
functions on [0,oo), and ^(z)(t) = sup 0<s<t (— z(s)), for any process z £ Aim- Let {y n } C T>n m 
be a sequence of functions such that y n ^4' y as n — > oo. Then, with respect to Skorokhod' s M\ 
topology, y n := ^ (yjnx + y n ) — y/n i S(x) — > y := sup seV i(— y(s)) as n — > oo, where Vf = {0 < s < 
t\x(s) = -Wx(t)}. 

Observe that Y n is exactly in the form of y n defined in the lemma above. Since it has been 
shown that X n converges uniformly on compact sets of [—To, oo) to the continuous process X in 
Lemma 2, applying the lemma above, it follows that 

Y n ^ Y : = max(-A») in {V lim ,M 1 ) 

Now, using Lemmas 2 and the immediate result above we have 

Q n = X n + Y n ^ X + max (-X(s)) in (A Mi). 

Finally, the weak convergence in the statement of the Theorem follows from the fact that we had 
replaced the weak convergence in (9) with almost sure convergence using the Skorokhod represen- 
tation Theorem. □ 
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Remarks. 1. Observe that the diffusion limit to the queue length process is a function of a 
Brownian bridge and a Brownian motion. This is significantly different from the usual limits 
obtained in a heavy-traffic or large population approximation to a single server queue. For instance, 
in the G/GI/l queue, one would expect a reflected Brownian motion in the heavy-traffic setting. In 
[18] it was shown that the diffusion limit process to the M^/Mj/1 queue is a time changed Brownian 
motion W(J \(s)ds + j fi(s)ds), where X(s) is the time inhomogeneous rate of arrival process and 
fj,(s) is that of the service process, reflected through the directional derivative reflection map used 
in Lemma 3. There are very few examples of heavy-traffic limits involving a diffusion that is a 
function of a Brownian bridge and a Brownian motion process. However, there have been some 
results in other queueing models where a Brownian bridge arises in the limit. In [26], for instance, 
a Brownian bridge limit arises in the study of a many-server queue in the Halfin-Whitt regime. 

2. We noted in the remarks after Theorem 1 that the fluid limit can change between being positive 
and zero in the arrival interval for a completely general F. One can then expect the diffusion 
limit to change as well, and switch between being a 'free' diffusion, a reflected diffusion and a 
zero process. This is indeed the case. Figure 2 illustrates this for the example in Figure 1. Note 
that Vt £ [-T 0j ti) $>(X)(t) = -X(-To), implying that the set Vf is a singleton. On the other 
hand, at n Vf = {-T ,ti}. For t £ (ti,t 2 ], V(X)(t) = = X(t), implying that Vf = (n,*]. 
On (t 2 ,t 3 ), V(X)(t) = 0, but X(t) > 0, so that Vf = (tl,t 2 ]. Finally, the fluid queue length 
becomes zero when the fluid service process exceeds the fluid arrival process in [t3,oo), implying 
that ty(X)(t) = — (F(t) — fit) > 0. It can be seen that Vf = {t} in this case. 

3. Recall that the idle time (5) was defined as the amount of time the server idles. In the remarks 
following Theorem 1, Lemma 1 showed that (15) is an equivalent definition of idling in the fluid 
limit. The following proposition extends this conclusion to the diffusion-scaled processes. 

Lemma 4. As n — > oo 



The proof is available in the Appendix. 
4.1.1 Why Mi, and not J x ? 

We now discuss why we establish the diffusion limit in the Polish space (T>n m ,Mi), and why it 
can't hold in the space (£>ii m , J±). This section can be skipped on a first reading without any loss 
of continuity, though we encourage the reader to read it for a better understanding of Theorem 2. 

There are several equivalent definitions of convergence in the M\ topology (the interested reader 
is directed to [28, 34, 33] for a more in-depth study.) A simple characterization of convergence 
in Mi for processes with range in M is the following involving the number of visits to a strip 
[a, (3] C R in an interval [ii,i 2 ] C [rj,oo). Let y € V (or X>ii m ) and suppose there are N + 1 
points ti < i( ) < i(i) < ... < t(jv) < h such that either y(t(o)) < a,y(t^) > f3,y(t( 2 )) < <*>■■■> 
or y(i(o)) > /3,y(t(i)) < a,y(i( 2 )) > /?,••• Then, there are N visits to the strip in [ii,t 2 ]. Let 
v ^ft 2 -\{y) 1 — y N be the number of visits to the strip [a, /3] in [t±, t 2 ] by the function y. Definition 1 
summarizes this characterization [33]. 

Definition 1 (Convergence inM±). Lety,y n be elements ofV and oIm 1 (-, ■) the Mi metric. Then, 
(yn, y) — > as n — > oo if and only if 



Vnf-F n ^0 in (V hm ,U). 
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Figure 2: An example of a A^/GJ/T queue that will undergo multiple "regime changes". The 
diffusion limit switches between a free Brownian motion (BM), a reflected Brownian motion (RBM), 
and the zero process. 

Convergence in the J\ topology can be seen as a "relaxation" of the definition of convergence 
in the uniform metric topology. Specifically, let z n , z be elements of the space £>ii m [?7, oo). Fix 
T £ [77,00) that is a continuity point of z, and let || • || be the local uniform metric on the interval 
[77, T]. Define A to be the set of all non-decreasing continuous homeomorphisms from [77, T] to itself. 
Then, convergence in J\ can be defined as follows. 

Definition 2 {Convergence in Ji). There exists a sequence {A n } C A such that \\\ n — e|| — > as 
n — > 00, where e is the identity map, dj 1 (z n , z) — > as n — > 00 if and only if \\z n o \ n — z o e|| + 
1 1 A n — e|| — > as n — > 00. 

It is well known that the Mi topology is weaker than the U (uniform) or J\ topologies, and 
processes converging in Mi need not converge in U or J\. 

As already stated, the diffusion limit for the queue length process is obtained in the space "Dii m 
when endowed with the Mi topology because the directional derivative reflection mapping lemma 
(Lemma 3) that we use yields convergence in the Mi topology alone. Intuitively, the reason the 
convergence result holds only in Mi is that asymptotically y n converges to a continuous process, 
and it is well known that continuous processes can converge to discontinuous limits only in the Mi 
topology. To concretize this intuition we give a counterexample that shows that convergence in J\ 
is not possible in this case. 

It will suffice to show that for some e > at least one of the terms in the expression dj x (z n , z) = 
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z n ° — z o e|| + || X n — e\\ exceeds e. Define the process 



y n = ^(y/nx + y) — y/n^(x) 



where x is the function in Figure 3, and y is a Brownian motion. We show that there is a non-empty 
set of points M T in the vicinity of r where the normed distance dj 1 (y n , y) > e, for any e > 0. Recall 
that y = sup sgV i(— y(s)). The next proposition formalizes this argument. 



Figure 3: This x G C[r/, oo) corresponds to the fluid netput process, when F is uniform. 

Proposition 4 (Non-convergence in J\). Let x be the function in Figure 3, {y n } C T>n m [r], oo) 
and y € C[t], oo) is a Brownian motion, such that y n ^4' y in (T>\i m [r/, oo), U). Then, the process 
y n = ^{^/nx + y n ) — y/n^(x) does not converge to y in the J\ topology as n — > oo. 

The proof is available in the Appendix. 

Thus, we see that the process y n does not converge to the directional derivative of the reflection 
map in the J\ topology (and hence even the uniform topology), necessitating the use of the M\ 
topology. This result clearly implies that Y n does not necessarily converge to Y in the J\ topology 
either. Thus, we have a situation where the limit process is discontinuous and the limit result can 
only be proved in the M\ topology. This is also true for the M t /M t /1 model, however unlike [18], 
we can also establish the failure of convergence in the J\ topology for the A^/GI/1 queue. 

4.1.2 Uniform Arrival Distribution. 

Note that X/f is a correspondence that maps each time t to the set of points (upto t) at which 
the fluid netput process is equal to its infimum at t. Theorem 2 shows that the diffusion limit to 
the queue length process is in fact piecewise continuous since Y is. We now specialize the diffusion 
limit results to the case of a uniform F with early-bird arrivals. This illustrates with greater clarity 
the discontinuous nature of the limiting processes. 

Corollary 2. Let F be the uniform distribution on [—Tq,T], where —Tq < 0. Then, 



X 




t 



f W°(F(t))-a^W(t), 
Q(t) = I (W°(F(t)) - <th*W(t)) + (-(W°(F{t)) - <t/Jw(t))) 
,0, 



Vt€ [-T ,r) 



t = T 

Vt G (r, oo), 



where r = {— Tq < t < oo | F(t) = fit}. 
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Proof. Recall from Theorem 2 that Q = X + sup sgV x(-X) where X = W° o F — a/i^W o B, and 

B is the fluid busy time process. Now, using the definition of V^, it is easy to deduce that in this 
case we have 

({-T }, Vie [-T , r), 

V? = 1{-T ,t}, t = T, 

{{t}, V tG(r,oo). 

Further, Corollary 1 yields 

B(t) = I *' ^[-Zb.r], 
\ 0, Vt€(r,oo). 

Using these facts the conclusion follows by substitution. □ 

The time r can be interpreted as the first time that the fluid service process catches up with 
the fluid arrival process. For a uniform F there is at most one such point, but in general there can 
be many such points. 

Interestingly, the nature of the discontinuity at Q(t) depends on the the sign of X(t). The 
following corollary clarifies this statement. Recall that t is a point of right- discontinuity for a 
function x £ T>n m if x is left-continuous at t, and x(t—) > x(t+). On the other hand, t is a point 
of left- discontinuity if x is right-continuous at t, and x(t+) > x(t—). 

Corollary 3. Let F be the uniform distribution over [—Tq,T\, where To > 0, and r = {—To < t < 
oo\ F(t) = fxtli t>0 y}. Then, for the process Q in Corollary 2, we have 

(i) [— To,t) U (r, oo) are points of continuity. 

(ii) t is a point of right- discontinuity, when X(t) > 0. 

(Hi) t is point of left- discontinuity, when X(t) < 0. 

The proof is available in the Appendix. 
Remarks: 1. In Corollary 2, Q is piecewise continuous on [— To,oo), with a single point of 
discontinuity at r. Interestingly, Q(t) is determined by the value of the process at r— . If Q(r—) 
is non-positive, then the value of Q(r) is zero. On the other hand, if Q{t— ) = X(t) > 0, then 
Q(t) = X(t) = Q{t—). However, at r+ the queue length immediately falls to and remains there 
forever after, as the reflection regulator map becomes positive for all time after r. 

2. A useful way to interpret the discontinuity at r in Corollary 2 is to consider the process on 
the two sub-intervals separately and try to "patch" them together. If Q(t—) = X(r) = Q(r) > 
we should expect a free diffusion path on the interval [— To,r], and a reflected process such that 
the path is on (r, oo). Furthermore, Q(t) becomes the "starting state" for the process on the 
interval (r, oo), and the reflection operator is applied an instant after r. On the other hand, if 
Q(t—) = X(t—) < we have a free diffusion on [— Tq,t) and the zero process on [r, oo), i.e., the 
process drops to zero at r. Thus, Q(t— ) provides the starting conditions for the new "regime" of 
the diffusion, as the process transitions from [— Td,t) to (r, oo). 

3. We note that in [17], a diffusion approximation to the queue length process is derived inde- 
pendently for different operating regimes. However, these limit results have not been "patched" 
together to obtain a "process-level" convergence result, which is precisely where the mathematical 



18 



challenges lie. We also note that diffusion limits in [17] do not involve directional derivative maps 
since the processes are continuous over the intervals on which they are studied. 

4. It is also pertinent to mention that the limit results in [17] are obtained in the uniform topol- 
ogy at what are the continuity points of the limit process between regime changes. However, as 
we noted above, there are discontinuities in the limit process at points such as r where regimes 
switches, precluding the possibility of establishing a "process-level" limit in the uniform topology 
and necessitating the need to establish the limit in a weaker topology. 



4.2 Virtual Waiting Time Process. 

We now establish a diffusion limit for the virtual waiting time process. We first prove a limit result 
for the diffusion-scaled busy time process. 



4.2.1 Busy Time Diffusion Limit. 

Recall from Section 3 that the fluid-scaled busy time process is given by B n (t) in (11). It was 
shown in Corollary 1 that this process converges to a continuous process B(t) as n — > oo. Define 
the diffusion-scaled busy time process as 

B n := y/n(B - B n ). (31) 

Note that from the definitions of B n {t) and B(t) it follows that B n (t) = 0, Vt < 0. The diffusion 
limit for this process is given as follows. 

Corollary 4. (i) As n — >■ oo, 

B n B := -max(-X(s)), in (P Um ,Mi). 



(ii) Let the arrival distribution F be uniform over [— Tb,T] ; where —To < 0. Then, 

Bit) = < 



'0, t€[-T , T ), 
i(0V (-*(<))), t = r, 
,i(-X(t)), t€(r,oo), 



where r = {t > 0\F(t) = fit}. 
Proof. Recall that 

B n (t) = tl {tm -I n (t). 

Substituting this and B from (16) in the definition of B n , and rearranging the expression, we obtain 

B n = -Y. 



A simple application of Theorem 2 then provides the necessary conclusion. 
The proof of part (ii) follows easily by substitution. 



□ 
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Observe that B n (t) is approximated in distribution by B as 

B n (t) ~ B(t) - -i=B(t), 
\Jn 

where X n ps X is defined to be 

F{X n <x)& F{X < x), 

and the approximation is rigorously supported by an appropriate weak convergence result. 

In the case of a uniform F, it can be seen that on [— Tq,t) the queue length in the fluid limit 
is positive. However, as the server starts at time 0, the only interesting sub-interval of [— Tq,t) is 
[0, r). Using the appropriate definitions, note that B(t) = t and B(t) = for all t G [0, r), implying 
that B n (t) = t approximately, though in the non-asymptotic regime B n (t) may be strictly smaller 
than t. 

On the other hand, the fluid queue length is zero in (r, oo) and it follows from definition of 
that B(t) = t - ±(-X(t)) = j;F(t) for t G (r, oo). Substituting this expression together with 

that of B, and expanding X, we see that 



B n [t) w t + + X (t)) = - F(t) + -^W»(F(t)) - a^W{F{t)) ) , 



where the second = is due to the fact that we used the Brownian motion scaling property. Note 
that this depends on the arrival distribution F alone. In the fluid limit of the busy time process, 
we see that B(t) = F(t)/fi which is the fraction of time from the interval [0,t] that the queue has 
spent serving. 

4.2.2 Virtual Waiting Time Diffusion Limit. 

Now, consider the centered virtual waiting time process given by 

Z n (t) = Vn(Z n (t) - Z(t)) Vt G [-T , oo), (32) 

where Z(t) is defined in (18) and Z n (t) is defined in (17). Corollary 5 proves the diffusion limit to 
this process. 

Proposition 5 (Diffusion Transient Little's Law). (i) As n — > oo, 

Z n Z := 1 Q + a^' 2 W o B - a^ 1/2 W o F in (X» Um , Mi) . (33) 



(ii) Let F be uniform over [— Tq,T], where —To < 0. Then, 
Z(t) = < 

where r := {-T < t < oo | F{t) = fit} 



' \{W\F{t)) - a^W{t)} - a^W^Fit) - »tl {t > 0} )), Vt G [-T ,r) 
J{(W°(F(r)) - <th*W{t)) + (-(W°(F(t)) - a^W(r)))+, t = r 
[0, Vt€(r,oo), 
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Proof. Expanding the definition of Z n (t) in (32), and introducing the term 



1 A"(t) 
fj, n '■ 



we obtain 



Z n (t) = ^[V n (A n (t))--^^ + -^^-^ + B(t)-B n (t)\. 



H n 



[i n 



/i 



Now, using the random time change theorem (Section 17 of [3]), Proposition 1 and Proposition 
2, we have 



[V n oA n - 



1 A n 
fi n 



-a^Wo- m(V hm ,U). 
A* 



(34) 



Using this fact, Proposition 2 and Corollary 4, it follows that 

Z n => Z - afi 1/2 W o — + -W° oF + B in (Aim, Mi). 
fi ll 

Note that W and VF° are independent processes. Adding and subtracting the process ai^^WoB 
where W is the Brownian Motion in (34), we obtain 



Z = 1 Q + ( ctjU 1 / 2 ^ oB — afi^ 2 W o - ] . 
M V A*/ 



For part (ii), the proof is straightforward and follows by using Corollary 2. 
Remarks. 1. The limit process in Corollary 5 is equal to 



I 1 \ A* / 



□ 



(35) 



Interestingly, the extra diffusion term is non-zero only when the fluid limit of the queue length 
process is positive, indicating that it arises from temporal variations in the operating regimes of 
the queue. To see this, note that the variance of the diffusion term is 



a 2 n E 



W{B{t)) - W 



a 2 n[B{t) + 



Fit) 



H n J 



where x A y := mm(x,y). Clearly, the expression on the right-hand side changes depending upon 
the ratio of the number of users arrived to the number served in the fluid regime at time t. It 
follows that 



cr 2 /i E 



W{B(t))-W 



a 2 \x\ 



Fit) 



o 2 n \ B{t) 



B(t) 



F(t) 
nB{t) 

F(t) 

»B(t) - 



> 1 



< 1. 



It is easy to see that the first condition above, F(t)/([j,B(t)) > 1, implies Q(t)/fi > 0. The second 
condition, F (t) / \llB {€)) < 1, implies Q(t) = 0. This in turn, implies 

(F(t) - Aitl {t > 0} ) + * GF(t) - At*l{t>o}) = 0. 
Rearranging this expression, it follows that F(t) = /xil{ t > }) — ^f(F(t) — /itl{ t > }). 
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Now, using the definition of B from (16) we have F(t)/ (pB(t)) = 1. It follows that the diffusion 
term is equal in distribution to the following (time-changed) Brownian Motion 

a ^/2 W fm_B(t)j =a^ 2 w(^j, Q(t)>0 
ap}' 2 W f5(t) - F f)=0, Q(t) = 0. 

This leads to expression (35). 

2. We note that (33) (or (35)) can be interpreted as a transient Little's Law that relates the diffusion 
limit of the virtual waiting time with the queue length diffusion limit process. This result is useful 
because it provides a sample path relationship between the workload and current queue state, in 
contrast to the standard Little's Law which provides a relationship between the customer-averaged 
workload and the time-averaged queue lengths. Note that the FCLT of the workload process in a 
G/GI/1 queue (see Chapter 6 of [5] for details) with arrival rate A and service rate p has the form 

Z(t) = 1 Q{t) + a^l\W{(p A l)t) - W(pt)), 
A* 

where p = X/p, is the traffic intensity function for the Gj GI/1 queue, and this is similar to (33). The 
extra diffusion term in (35) captures the variation of the workload, as the (fluid) queue transitions 
between various operating states (see Section 5 for more details). 

3. Another interpretation of the term a p l l 2 W (Q(t) / p) is that it is in fact the diffusion limit to the 
service backlog at time t, and the variation in the backlog at each point in time is captured in the 
term Q/p. Suppose that F(t) < pt then the fluid queue length process is zero and the server will 
idle, and the zero state is recurrent for the queue length process. The workload in the system (for 
most of the time when F(t) < pt) should be 0. On the other hand if F(t) = pt, whence the fluid 
queue length will be zero again but the server is not idle, it is reasonable to expect that the virtual 
waiting time is zero for an arrival at time t. However, there is a non-zero probability of the queue 
being backlogged at time t, and this fact is captured in the term j^Q/p- We discuss the various 
operating regimes mentioned here in greater detail in the next section. 



a^/ 2 (w(B(t))-W^^l< 



5 Sample-Path Analysis. 

As noted in Section 4, the limit process is piecewise continuous, with discontinuity points deter- 
mined by the fluid limit. Indeed, the discontinuity points are precisely where the fluid limit switches 
between 'overloaded', 'underloaded' and 'critically-loaded' regimes. We now provide formal defini- 
tions of these notions, in terms of the fluid limit arrival and service processes. 

We then characterize the sample path of the queue length limit process, and the points at which 
it has discontinuities. Developments in this section follow the study of the directional derivative 
process in [18]. However, the limit processes and the setting of our model is completely different. 
Thus, where necessary, we prove some of the facts about the sample paths. The operating regimes 
for the Mt/Mt/1 model in [18] and our A^/GI/1 model are quite similar, and we adapt the 
definitions to our model. 
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5.1 Regimes of Q 

It can be useful to characterize the state of a queue in terms of a "traffic intensity" measure. For 
instance, in the case of a G/G/l queue, the traffic intensity is the ratio of the arrival rate to the 
service rate. In [21], a traffic intensity function for the Mt/Mt/1 queue with arrival rate A(s) and 
service rate p,(s) was introduced as the ratio 

p*(t):= sup J t , t>0. 

o<s<t J s fjL\u)au 

Here, we adapt the form of this function and define the traffic intensity for the A^/GI/1 queue 
in terms of the fluid limit as 



Pit) :-- 



oo, Vie [-T , 0] 

^Po^^f 1 , VtG[0,f] (36) 
t0, Vt>T, 



where T := mf{t > 0\F(t) = 1 and Q(t) = 0}. Note that we define the traffic intensity to be oo in 
the interval [— Tq,0] as there is no service, but there can be fluid arrivals. It is also important to 
note that the definition of p* follows from the pre-limit system describing the arrival and service 
processes in the M t /M t /1 queue, whereas the definition of p is contingent on the establishment of 
the fluid limit processes as there is no explicit arrival 'rate' associated with the arrival process. In 
the case of a uniform F over an interval [—To, T], p is given by 

Now, consider the following obvious definitions of the operating regimes of the fluid A^/GI/1 
queue. 

Definition 3 (Operating regimes.). The A^/GI/1 queue is 

(i) overloaded if pit) > 1. 

(ii) critically loaded if pit) = 1. 

(iii) underloaded if p(t) < 1. 

Notice that these regimes correspond to the operating regimes of a time homogeneous G/G/l 
queue. However, since the queue length fluid limit in the A^/GI/1 queue can also vary with time, 
and analogous to the M t /M t /1 queue in [18], we also identify the following "finer" operating states. 
In particular, these states are useful in studying the approximation to the distribution of queue 
length process on local time scales. 

Definition 4 (Operating states.). The A^/GJ/l queue is at 

(i) end of overloading at time t if p(t) = 1 and there exists an open interval (a, t) or (t, a) such 
that p(r) > 1 for all r in that interval. 

(ii) onset of critical loading at time t if p(t) = 1 and there exists a sequence X n t t such that 
p(X n ) < 1 for all n. 
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(iii) end of critical loading at time t if p{t) = 1, and there exists a sequence X n f t such that 
p(A n ) = 1 for all n and a sequence j n \, t such that p(^ n ) < 1 for all n. 

(iv) middle of critical loading at time t if p(t) = 1, and t is in an open interval (a,b), such that 
sn Pte(a,b) P(*) ^ 1 an d there exists a sequence \ n 1 1 such that p(X n ) = 1 for all n. 

The following Lemma shows the equivalence of the definitions of the operating regimes to the 
process Q. 

Lemma 5. The A^/GI/1 queue is 

(i) overloaded at time t if Q(t) > 0. 

(ii) critically loaded at time t if Q{t) = 0, X{t) = ty(X)(t) and there exists an r < t such that 
V(X)(t) = V(X)(s) for all s G [r,t]. 

(iii) underloaded at time tifQ = 0,X(t) = ty(X)(t) and there exists anr < t such that ^(X)(t) > 
^(X){s) for all s £ (r,t). 

The proof of the lemma is in the appendix. 
5.2 Sample Paths 

We now characterize a typical sample path of the limit process Q, and show that the sample paths 
are upper-semicontinuous. Also, as noted in Corollary 2, in the case of a uniform F, the fluid limit 
appears to switch regimes at the point r which is the point at which F(t) = fit. In fact, the point r 
was a discontinuity point in the sample path of the diffusion limit. We now show that this behavior 
holds more generally and give conditions under which a particular point is a discontinuity point. 

Proposition 6. The process Q is upper-semicontinuous almost surely. It is discontinuous at time 
t, with a non-zero probability if and only ift is the end-point of overloading or critical loading. The 
set of such points is nowhere dense. 

The proof can be found in the appendix. 
Remarks. 1. We note that the queue length limit sample paths for the M t /M t /1 model are also 
upper-semicontinuous as shown in Theorem 3.1 of [18]. There the sequence of converging processes 
was shown to be monotone, which easily leads to upper-semicontinuity by Dini's Theorem. As this 
monotonicity property does not hold for the corresponding processes in the Au\/GI/l model, we 
argue that the sample path is upper-semicontinuous directly from the characterization of the points 
of continuity and discontinuity in the domain of the sample path. 

2. The intuition for the regime switching behavior proved in Proposition 6 is easy to see in the case 
of a uniform arrival distribution with early-bird arrivals. As noted earlier, in this case the (fluid) 
queue is overloaded on the interval [— Tq,t) with the singleton set Vf = {—To}, and underloaded 
on the interval (r, oo) with the singleton set = {t}. At r itself, there are two points in the set 
Vf = {-T ,r}. Thus, there is a discontinuity due to the fact that the set changes from being 
a singleton on the interval [— Tq,t) to {— Tq,t} at r. 
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6 Distributional Approximations. 



Theorem 2 proved a diffusion limit to the standardized queue length process. In this section, we 
illustrate how this limit process can be used to approximate the queue length distribution of the 
exact (pre-limit) model. That is, we have proved that 

Q n inQ + VnQ. 

d 

Here, we use the notation ~ to mean "approximately equal in distribution" , with the understanding 
that the approximation is rigorously supported by the weak convergence in Theorem 2. Our goal is 
to study this distributional approximation as Q and Q vary through the various operating regimes 
from Section 5.1. Notice that the results developed in this section are directly applicable to the 
virtual waiting process as well, due to the transient Little's Law developed in Proposition 5. More 
precisely, recall that we have 

Vn{Z n -Z)i-Q-a l x 1 ' 2 Wo9.. 

Rearranging the expression and substituting for Z from Proposition 3, and using the fact that 
Q n « nQ + yJnQ, it follows that 

d lQ n 1/2 Q 

Z a\x 1 W o e, 

[in fi 

where 

'i, VKO 



e(t) := 



0, Vt>0 



So, the distributional approximations developed in Theorem 3 can be used to study the virtual 
waiting time under the various operating regimes and states. 

Theorem 3 (Distributional Approximations). The queue length can be approximated in the various 
operating regimes as follows. 

(i) Overloaded state. An overloaded interval is given by (t*,r), where t* := supVf, and r := 
inf{s > t*\p(s) = 1}. Ift€(t*,r), then 

Q n (t)/^& Z?We (t*,T), 

where Z™ is the strong solution to the stochastic differential equation dZf 1 = \/n(f(t) — fj,)dt + 
\Jg (t)dW t V£ G (t*,r) with initial condition Z t * = X(t*) - X* , where X* := sup sgV x (-X(s)) 

and g(t) = F(t)(l - F(t)) + a 2 fi 3 B(t). 

(ii) Underloaded state. If t is a point of underloading, then p(t) < 1. This implies 

Q n (t)/V^io. 
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(iii) Middle- and End- of- critically-loaded state. A critically loaded interval (t*,r) is open where 
t* is a point in the onset of critically loaded state and r a point at the end of critically loaded state. 
Furthermore, for any t G (t*,r), let u = t — t* and we have 

Q n (t)/Vni(X(u)+ sup (-X(s))), 

0<s<u 

where X{u) = X{t) — X{t*), and X(t) is the strong solution to the stochastic integral f^ To \J ' g ' (s)dW s . 

(iv) End of overloading state. Let t be a point of end of overloading. Then, for all r > 

9 {t ~ r Wl) &(X(t)+ sup (-*(*)) -(/(t)- M )r)+ 
V" sevf\{t} 

where f(t)dt = dF(t) is continuous. 

The proof is relegated to the appendix. 
Remarks: 1. Overloaded regime, (i) The approximate distribution is Gaussian with mean F(t)—/j,t. 
However, the variance is affected by the fact that the queue may have idled in the past. This is 
due to the fact that the time-varying variance, given by g(t) = F(t)(l — F(t)) + a 2 ii ? 'B{t), depends 
upon the fluid busy time. Recall from Corollary 1 that 

Bit) = l {tm - ^(X)(t). 

(ii) We note that this result is analogous to case 5 of Section 4 in [17]. However, in [17], the author 
notes that no reflection need be applied in an overloaded sub-interval, and proceeds to derive the 
limit process (in this interval alone) as W° o F(t) — afi 3 ^ 2 W(t). This is correct but it misses the 
point that the starting state of the process in each new interval of overloading must be factored 
into the approximation. That is, while X/f is fixed for all t in an overloaded sub-interval, the value 
sup sgV x (— X(s)) provides the starting state for the diffusion in the overloaded sub-interval. It 
also missed the fact that if the queue had idled before t (i.e., it had entered an underloaded phase 
previously) then B(t) < t, and the variance g(t) will be less than g(t) = F(t)(l — F(t)) + a 2 fi 3 t. 

2. Underloaded regime. In this case, Q(t) = 0, and that = {t}. It follows that Q{t) = 
X(t) + S u PseV? (-X(s)) = 0. 

3. Critically-loaded regime. The queue length process of the Au\/GI/l queue in the critically 
loaded regime is approximated by a driftless reflected diffusion process, with starting state X(t*). 
Furthermore the diffusion limit in the critically loaded sub-interval has continuous sample paths. 
By the definition of a critically loaded state, we know that p(t) = 1 at all such points, and 
"accumulates" the points of critical loading, as t evolves through the critically loaded interval. It 
follows that the set Vf is the interval (t* , t] . 

4. End of overloading regime. As noted in the definition, a point t is an end-of-overloading if the 
traffic intensity is 1 at t, and is strictly greater than 1 at all points to the left of it. Here, we are 
primarily interested in the rate at which the queue empties out asymptotically as overloading ends. 
Consider a sequence of r n defined as a sequence of times at which the queue in the nth system first 
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empties out. Define v := t — J ^=. Then, from Theorem 3 

/(*)-/* 

Thus, it can be seen that the time at which the queue empties out converges to a Gaussian random 
variable. A similar conclusion was drawn in [17] for the setting there, and in [18] for the Mt/Mf/1 
queue. 



7 Simulations. 

We now present some simulation results to illustrate the validity of the approximations in Theorems 
1-3 as the population size increases. Consider a uniform arrival distribution over the interval 
[—20,40], with service times i.i.d. and exponentially distributed with parameter \i = 0.03. We 
simulated the A^/GI/1 queue with population sizes n = 10,25,100,1000. Figure 4(a) shows 
typical sample paths for the queue length, arrival and service processes, when n = 100. In Figure 
4(b) we compare the sample paths of the queue length process as n increases from 100 to 1000. 
Note that the fluid limit of the queue length drops to zero at t = 25, and in the vicinity of t = 25 
for the exact model, i.e., as the population size increases we see that the sample paths are closer 
to the theoretical fluid limit process. 

Figures 5(a) and 5(b) show the sample mean and the sample variance of the queue length 
process for n = 10, 25, 100, 1000 over 10,000 sample runs. Note that as n increases, the sample 
mean approaches the fluid limit, and the sample variance approaches the theoretical variance of 
the queue length process. Note that for the given F, this is 



a\t) 



F(t)(l-F(t)), Vt€[-T ,0] 
F(t)(l-F(t)) + a 2 n% Vt€(0,<r) 
[0, Vt > r. 



Also, observe from Figure 5(a) that even for small n, the sample mean is quite close to the 
fluid limit for t < 0. However, once queueing dynamics come into play, the fluid limit is a good 
approximation only for n = 100 or larger. Similarly, the diffusion limit is a good approximation to 
the sample variance even for small n. But once service starts, and queueing dynamics come into 
play, the diffusion limit becomes a reasonably good approximation only for n = 1000 or larger. 



8 Conclusions and Future Work. 

In this paper, we introduce a novel single server queueing model which we call the A^/GI/1 
queue. In this model, customers from a finite population independently choose (sample) their time 
of arrival at the queue from a common distribution function. The arrival times are, thus, order 
statistics and the inter-arrival times are differences of consecutive order statistics. Service times 
are i.i.d. with some general distribution G, and the service rate is fixed. 

This new queueing model differs from classical models in two ways. First, the arrival process 
is not specified in terms of the inter-arrival times, which are not renewal intervals. Thus, the very 
useful machinery of renewal theory cannot be used. Second, a finite population of customers arrive 
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at the queue in a fixed interval. This implies that the queue itself is extant only for a finite time, 
and this can be considered a transitory queueing model. These unique features are consequential 
to the analysis of the queue. Studying the finite model is analytically intractable. Furthermore, 
the queue itself is transitory, and thus can only be studied in the transient setting. In fact, the 
Au\/GI /l queue does not have a stationary state. 

Thus, we develop pathwise asymptotic approximations to the system performance metrics as 
the population size increases. We first derive a fluid limit to the queue length process and show that 
the fluid queue length process is the difference between the arrival distribution and the fluid service 
process, with a reflecting barrier at 0. It is also shown that the queue length process can switch 
between overloaded, underloaded, and a critically-loaded regimes. We then derive the diffusion 
limit to the queue length process in terms of the 'netput' process and a directional derivative of the 
Skorokhod reflected fluid netput in the direction of a diffusion refinement of the netput process. This 
weak convergence result is shown to hold in the M\ topology on the space T>\ lxa . In fact, we show 
that this convergence cannot hold in the stronger J\ topology, and by implication in the uniform 
topology. A novel feature of the diffusion limit is that it is a function of a Brownian bridge and a 
Brownian motion process. This appears to be a unique observation since such diffusion limits have 
not arisen in the study of conventional heavy-traffic or large population approximations to a single 
server queue. The diffusion limit process we derive switches between a free diffusion, a reflected 
diffusion and the zero process. This indicates an interesting connection with time inhomogeneous 
queues even though the setting of the A^/GI/1 queue is different. 

Our original motivation for introducing the Au\/GI/l model came from the 'concert arrival 
game', a game of arrival timing introduced in [13]. Customers choose to arrive at a queue to 
minimize an expected cost functional that depends on the waiting time and the number of people 
who have already arrived. The Nash equilibrium analysis was done in the fluid limit, and it was 
established that for linear cost functionals, the uniform arrival distribution is a Nash equilibrium. 
With a given arrival distribution, this is a A^/GI /l queueing model, and the details of the queueing 
dynamics in the game are also of interest. Another question of interest is whether the equilibrium 
derived from the fluid model approximates in any way the equilibrium of the finite population 
'concert arrival game'. Note that this is in the spirit of mean field equilibrium approximations of 
finite stochastic game models [4, 1, 30]. 

Our next step is to take the diffusion approximations for the A^/GI/l queue model, and 
revisit the 'concert arrival game' problem. In [13], the assumption is that the queue lengths are 
unobservable. Our diffusion approximations can now allow us to study other situations where the 
queue length are fully or partially observable. In the spirit of mean field game theory, this could 
be understood to be a 'diffusion field game theory'. 

The new queue model we introduce should also be of interest in other scenarios, and some 
empirical validation of the queueing model would be useful. Thus, another activity we plan to do 
in the future is to acquire data for some common queueing situations, for instance queues at lunch- 
time cafeterias, postal and DMV offices, some enterprise call centers, and check if the A^/GI/1 
queue could be a reasonable model. 
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Appendix 
Proof of Lemma 1. 

We first prove a property of the first ordered random variable. Let P be a probability measure such 
that F(x) = F(Xi < x). Now, fix e > 0, and consider the event 

E n := {u-.X^yS + e}. 

The measure of this event is 

F(E n ) = P(X ( n 1} > S + e) = P(inf{Xi, ...,X n }>e). 
Since the samples are i.i.d., it follows that 

F(E n ) = (P(Xi > S + e)) n = (1 - F(S + e)) n . 
Since F(S + e) > for any e > 0, it follows that 

oo 
n=l 

Thus, by the First Borel-Cantelli Lemma [6], it follows that P(X^ > S + e i.o.) = 0, implying that 

X ( n 1} I S a.s. (37) 
Now, fix r > 0. It is easy to see that I n (t) < Y n (t), Vt G [~T ,t]. Also, I n (t) > Y n (t) - 
J-To l{Q n (s)=o}ds, \/t G [— Tq,t]. It follows that 

/rpn 
(1) l {Qn{s)=0} ds > Y n {t) - I n {t) > 0. 
-To 

The conclusion follows by using (37) with S = —Tq. Since r is arbitrary, the lemma is proved. □ 

Proof of Lemma 3. 

Rewrite y n as 

y n = {^{^x + y n ) - $>{V^x + y)) - {^{^x + y) - Vn*(s)). 

Now, using the fact that the Skorokhod reflection map is Lipschitz continuous under the uniform 
metric (see Lemma 13.4.1 and Theorem 13.4.1 of [34]) we have 

(^(Vnx + y n ) - ^{Vnx + y)) < \\y n - y\\, 

where || • || is the uniform metric. It follows that 

Vn < \\y n ~ y\\ + {^{y/nx + y)- VnV(x)), 

Now, by Lemma 5.2 of [18] we know that as n — > oo 

(^(v^x + y) - v/n^(x)) a 4' y, in (P Um , Mi). 

Using this result, and the fact that by hypothesis y n converges to y in (£>ii m , U) we have 

y n °4 y, in (Pi im ,Mi). 

□ 
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Proof of Lemma 4. 



We prove the claim for a uniformly distributed random variable. The result for a random variable 
with an arbitrary absolutely continuous distribution function will follow by the usual transformation 
of the random variable to a uniform random variable. Let T be a uniformly distributed random 
variable with support [0, 1]. 

Let = inf{Ti,--- , T n } be the first order statistic of n i.i.d. uniform random variables. 
Then, for any e > 0, using the independence of the random variables {Tj}™ =1 , 

Here, (■)" represents a power of n. 

A standard limit result on 1 = M U {oo} states that if x n — > x as n — > oo then 

( 1 + ^) n ^ eX asrwoo. (38) 
Using the result in (38) it is straightforward to deduce that 

lim P(V^T-, (n) > e) = 0, 

n->oo 

thus proving the claim for a uniformly distributed random variable. □ 
Proof of Proposition 4. 

Recall that y n converges to y in the M\ topology, from Lemma 3. Now, consider a path of y that is 
non positive at r. Thus, the limit process y has a discontinuity at r such that y(r) > y(r+). Note 
that the process path is left continuous at r. Assume that y(r) — y(r+) > 5 > 0, and it follows 
that y(r) = —y(r) > 5 (since y(r+) = 0). Fix an e > such that 5 > e. Now, by the continuity of 
y, there exists rj > such that swp te ! T _ VT+r] -\ \y(i) — y(r)\ < |. Then, there also exists a no such 
that for all n > no, > —^Jnx(t) > — | for t G [r — n, r]. 
Then, for any i G [r — n] it follows that 

-y/nx(t)-y(t)+y(r) > -|. 

This implies that — y / nx(t) — y(t) > S — | > |, since 5 > e. It follows that ^(y^x + y) > | for 
all time points £ G [r — n, r]. Thus, it cannot be the case that uniform convergence is possible on 
any compact set of [— To,oo). Furthermore, consider any sequence {A n } C A. Then, for large n, 
by assumption, A n is uniformly close to the identity map. Thus, any distortion introduced by the 
homeomorphism will be minimal, and the same argument will show that it cannot be the case that, 
for any fixed e > 0, \\y n o A n — y o e\\ < | for large n, and there is a set of points determined by rj 
(due to the continuity of y) where it is the case that \(y n o X n )(t) — (y o e)| > |. □ 

Proof of Lemma 5. 

First, suppose Q(t) > 0. It follows that F(t) — /it > inf_y < s <t(i ? (s) — ns) = w where the latter 
equality follows because the queue starts empty at time 0, and the fluid netput is positive before 
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time (Note that we ignore the positive part operator in the definition of ^ as the systems starts 
empty at time —To). Now, let t* = sup{0 < s < t\(F(s) — fis) = mfo< s <t(-F(s) — H s )} be the point 
at which the infimum is achieved, on the right hand side. It follows that 

F{t)- fit > F(t*)-fit*, 

in turn yielding 

F(t) - F(s) 
pit) = sup ±1 > i. 

0<s<t V{t ~ S) 

Next, suppose Q(t) = 0, X(t) = fy(X)(t) and there exists an r < t such that ty(X)(t) = *(X)(s) 
for all s G [r,t\. It follows that F(t) - fit = — sup_<^ < s <£ (— (F(s) — l-^s)), implying there exists a 
point r* G [0, t] such that F(t) — fit = F(r*) - fir*. This, in turn, implies that 

F(t) - F(s) F(t)-F(r*) _ 
o<s<t V(t-r) fi(t-r*) 

However a simple contradiction argument shows that 

F(t) - F(s) 

sup -AA ±1 > 1 

o< s <t fJ-{t - r) 

is impossible, implying that 

sup F(4 > - F(S > - - 



0<s<t Kt - r ) 
Finally, consider case (iii). We have, Vr < t, 

-(F(t)-frt)= sup (-(F(s) - fis)) > sup (-(F(s)- fis)). 

-T <s<t -T,)<s<r 

It follows that —(F(t) — fit) > —(F(r) — fir), implying 

i> F(t )- F P *re[0,t). 
fi(t-r) 



□ 



Discontinuity Conditions. 

The following lemma consolidates Lemmas 6.5, 6.6 and 6.7 in [18], and we provide it for refer- 
ence. The lemma characterizes the points of discontinuity (and continuity) of the process Y(t) = 
sup s( _ v x(— X(s)) in relation to the correspondence V*. 

Lemma 6. A point t G [— To,oo) is characterized as follows. 
(i) Continuity Conditions. 
The following are equivalent: 

1. t is a continuity point. 

2. t G Vf = {t}, or t Vf , or t G Vf / {t} and t is not isolated in Vf and Vf C 

for some u > t. 
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(ii) Right-discontinuity Conditions. 
The following are equivalent: 

1. t is a point of right- discontinuity. 

2. t G Vf ^ {t} and V* C (t, u] V u > r. 

3. Y(t) = Y{t-) > Y(t+) = -X(t). 

(iii) Left- discontinuity Conditions. 
The following are equivalent: 

1. t is a point of left- discontinuity. 

2. teVf^ {t} and t is isolated in Vf . 

3. Y(t) = Y(t+) = -X{t) > Y(t-). 

A point of right-discontinuity can be seen to be left-continuous, coupled with an ordering on 
the right and left limits, such that Y(t—) > Y(t+). Similarly, a point of left-discontinuity is 
right-continuous, and the limits are ordered such that Y(t+) > Y(t—). Using these definitions, we 
proceed to prove the upper-semicontinuity of the limit process. 

Proof of Corollary 3. 

Recall that Q = X + Y , where Y(t) = sup s6V x (— X(s)). The proof of (i) follows directly from part 

(i) of Lemma 6. Next, recall from the proof of Corollary 2 that V^" = {— To, r}. Thus, r is isolated 
in the set and it follows that part (iii) of Lemma 6 is satisfied. On the other hand, recall that 
Vf = {t} C (r,t], Vt > r, and r can also be a point of right-discontinuity, by part (ii) of Lemma 6. 
Thus, r is one or the other depending on the path of X. If X(t) < then Y(t+) = Y(t) > Y(t-) 
and r is a point of left-discontinuity. Otherwise, if X(t) > 0, then sY(t) = Y(t—) = > Y{t+) 
and r is a point of right-discontinuity. □ 

Proof of Proposition 6. 

By definition, X is continuous, and it suffices to check that a sample path of the component 
Y(t) = sup sgV x (— X(s)) is upper-semicontinuous. To see this, consider the pullback of the level 

set y _1 [a, oo) = {t G [— To, oo)\Y(t) > a}. It suffices to check that this is a closed set [27]. Let 
{T n } Q {t € [— To, oo)\Y(t) > a} be a sequence of points such that r n — > r as n — > oo, where 
r G [—To, oo) is an arbitrary point in the domain of Y. Thus, if e > 0, then there exists an no € N 
such that V n > no, e > r — r n > — e. If r is a continuity point, then the conclusion is obvious. On 
the other hand, suppose that r is a left-discontinuity point. By part (iii) of Lemma 6 it follows that 
Y(t— ) < Y(t+) = Y{t). By the definition of a left-discontinuity there exits an interval [t*,r), 
where t* = supV^\{r}, on which Y is (locally) continuous. Fix S > 0, then there exists an n > 
such that if r- —t > —n, then 5 > Y(t-) — Y(t) > —5. If e is small enough, then there exists no 
such that V n > no, r- — r ra > — rj. It follows that 5 > Y{r n ) — Y(t-) > a — Y{t—), implying that 
Y{t— ) > a — 5. Since 5 is arbitrary, it follows that Y[t—) > a, in turn implying that Y(t) > 0. 
Thus, r G Y~ l [a, oo). 
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Next, suppose that r is a right-discontinuity point. Then, from part (ii) of Lemma 6 we have 
Y(t) = Y(t—) < Y(t+). Furthermore, for any u > r, we have V;f C (r, u] implying that these 
are continuity points (by part (i) of Lemma 6). Using an argument similar to that for a left- 
discontinuity, on points to the right of r, it follows that Y(t) > a. This implies that the pullback 
set Y~ l [a, oo) is closed. As {r„} is an arbitrary sequence in Y~ l {a, oo) it is necessarily true that Y 
is upper-semicontinuous. 

The proof of of the first statement of the theorem followed as a result of the nature of the 
discontinuities. A natural question to be asked is when do these discontinuities manifest themselves? 
The second part provides the necessary answer. As the proof is similar to that of Theorem 3.1 in 
[18] we do not repeat it here. □ 

Proof of Theorem 3. 

(i) Overloaded regime. 

Proof. First, note that r is the first instant of an end of overloading phase, and the current over- 
loaded phase ends at r. In the overloaded state we have Q(t) > 0, implying that ty(X)(t) is a 
constant. Using the definition of Vf it follows that *(X)(t) = -X(t*), and Q(t) = X{t) - X(t*). 

Next, from Theorem 2, Q n (t) « nQ(t) + y/n{X{t) + sup seV;? {-X(s))). Let = y/nQ(t) + (X(t) + 

sup sgV x{— X(s)). Now, X(t) — X(t*) = J t * yj g'(s)dW s is a diffusion process that starts from at 

t*. Noting that does not change on the interval (t*,r), it follows that X* = sup s6 y{— X(s)) is 
a fixed random variable, and Z™ has an initial condition Zfi. = X(t*) — X* . □ 

(ii) Underloaded regime. 

This result is immediate from the definition of the limit processes, 
(ii) Middle- and End-of critically-loaded state. 

Proof. For any t G (t*,r) we have Q(t) = 0. From the weak convergence result in Theorem 2 we 
have 

Q n (t) w nQ(t) + y/nQ(t), 
and expanding the definition of Q it follows that 

Q n (t) « V^(X(s) + sup (-X(s))). 

se vf 

Using the fact that ^(X)(t) = w = —X(t) V t G (t*, r) in a critically loaded regime, it follows that 
Vf = (t*,t] for t€(t*,r). Thus, we have 

Q n (t) w V^(X(s) + sup (-X(s))). 

t*<s<t 

Let u = t — t*. Then, after a change of variables we obtain 

Q n (u + t*) w yfti(X(u + t*)+ sup {-X(s))). 

0<<s<u 
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Now, using the strong Markov property of a Brownian motion ([14]) we see that X(u+t*) — X(t*) = 
X(u). Substituting this into the expression above we see that we have, 

Q n (u + t*) = Q n (u)+X(t*), 

where X(t*) is the starting state of the process in the middle-of-critically loaded state. A similar 
argument will hold for the end-of-critical loading state as well. □ 

(iii) End of Overloading state. 



Proof. By definition we have 
Q n (t — ) 



y/n ^fn y/n y/n ' y/n 

Now, using the fact that F is absolutely continuous a derivative exists. Using the mean value 
theorem we know that there exists a point t £ [t — r/y/n,t] such that F(t — = Fit) — /(f) -^=. 

Adding and subtracting the term /(t)r/- v /n to the expression above we have 

p(t - J ? =) = m - mj= + - mj=. 

Substituting this into the expression for Q n above, and adding and subtracting the term X(t), we 
obtain 

Q n {t — ) 

' = X n (t - - X n (t) + X n (t) + Jn{F{t) - at) - (f{t) - u)t 

in Jn 



+*(X n + JnX){t ~^=) + (f(t) - f$))^=. 

v n v n 



Now, using Lemma 2 and the continuity of the limit process we see that X n (t — — X n (t) =>■ 
0. Further, since / is bounded by virtue of being defined on a finite interval we have r(f(t) — 
f(i))/y/n ->• oo as n ->• oo. Next, consider the term Z(t) := X n (t) + y/n(F(t) - fit) + ^{X n + 
y/nX){t— ^). Let 5 > be sufficiently small, so that the following decomposition of the expression 
above holds, 

Z n (t) = sup (X n (t) + y/n(F(t) - fit) - X n (s) - VnX(s)) 

-T <s<t-S 

V sup {X n (t) + ^{F{t)- at)-X n {s)- y/nX(s)), 
t-8<s<t-A= 

— — vn 

where t* = sup{V^\{i}}. Consider the first term on the RHS above, and call it Zf(t). Since the 
queue is overloaded before t no points are "added" to the correspondence V^, it follows from the 
definition of an end of overloading set that (F(t) - at) = -^>(X)(t) = -^(X)(t* + 5). This, in 
turn, provides Zf(t) = X n (t) + V(X + s/n~X)(t* + 5)- y/nV(X)(? + S). Using Lemma 3, it follows 
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that Zf(t) =>- X(t) + sup seV x\j t j(— X(s)) as n — > oo, followed by letting (5—^0. Next, consider the 
second term 

Z$(t) = sup (X n (t) + y/n(F(t) - fit) - X n (s) - y/KX(s)) 
t-5<s<t-^= 

— — yn 

< sup (X n (t) - X n (s)) + sup s/n(X(t) - X(s)) 

t-8<s<t-^= t-5<s<t-^= 

— — ^/n — — ^/n 

< sup {X n (t)-X n (s)) + sup yfii(X(t)-X( 8 )). 

t-S<s<t t~S<s<t-^= 

— — — — Vn 

For large n, as the queue is overloaded at t — ^ it follows that 

Z%(t)< sup (X(t)-X(s)) + y/ii(X(t)-X(t-^=)). 
t-S<s<t V n 

Using the fact that F is absolutely continuous, it follows from the mean value theorem that 
y/E(X(t)-X(t-^=)) = y/n{F(t) — F(t — n~j=) 

= y fr(f(t)- fl )^- + (f(t)-f{i))T, 

Jn 



where t € [t — r / 'y/n,t\. Since, t — > t as n — > oo, by continuity of / it follows that F(t) — f(i) — > 
as n — > oo. Then it follows by an application of Lemma 2 (and using the Skorokhod's almost sure 
representation) that 

m n .+ 0O Z2{t)<X{t)+ sup (-X(s)) + (f{t) - fi)r. 

t-S<s<t 

On the other hand, for a lower bound, using the mean value theorem again, we have 

zm > x n (t) - x n (t --!=) + (/(t) - v) T + {f{t) - f(t))r. 

Once again, using the continuity of /, the almost sure representation theorem and Lemma 2, and 
noting the continuity of the limit process X, we have 

Hm^i^i) > {f{t)-fi)r a.s. 

Now, using the limits derived for Zy and ZV; it follows that 

Q [t ~^- -(/(*) - M )r+ sup (X(t)-X(s))V(f(t)-»)T 

s evf\{t} 

= [ sup (X(t)-X(s))-(f(t)-fi)r} + . 
sevf\{t} 

□ 
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(a) Typical sample paths of the arrival, service and (b) A comparison of the scaled queue length pro- 
queue length process. cess for n = 100 and n = 1000 with the fluid limit. 



Figure 4: Sample path comparison for uniform F over [—20, 40] and exponential service with rate 
H = 0.03. 
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(a) Sample queue length process mean for n = (b) Sample queue length process variance for n = 
10,25,100,1000, averaged over 10000 simulation 10, 25, 100, 1000, averaged over 10000 simulation 
runs. runs. 

Figure 5: Mean and variance envelopes for F uniform over [—20,40], and exponentially distributed 
service times with rate ji = 0.03. 
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